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Abstract. We develop and analyze algorithms for dispersing a swarm of primitive 
robots in an unknown environment, R. The primary objective is to minimize the 
makespan, that is, the time to fill the entire region. An environment is composed of 
pixels that form a connected subset of the integer grid. There is at most one robot 
per pixel and robots move horizontally or vertically at unit speed. Robots enter R 
by means of k > 1 door pixels Robots are primitive finite automata, only having 
local communication, local sensors, and a constant-sized memory. 

We first give algorithms for the single-door case (i.e., k = 1), analyzing the 
algorithms both theoretically and experimentally We prove that our algorithms 
have optimal makespan 2 A — 1, where A is the area of R. 

We next give an algorithm for the multi-door case (k > 1), based on a wall- 
following version of the leader-follower strategy. We prove that our strategy is 
0(log(fc + l))-competitive, and that this bound is tight for our strategy and other 
related strategies. 



1 Introduction 

The objective of swarm robotics is to program a huge number of simple, tiny 
robots to perform complex tasks collectively. A typical application scenario 
for a robot swarm may involve two phases: first, the robot swarm fills an 
environment as quickly as possible while keeping the swarm "connected" 
(a term made more precise later). Once the robots are distributed in the 
environment, the robots perform world- embedded computational tasks, such 
as computing distances and short paths between points in the environment, 
finding chokepoints, mapping the environment, and searching for intruders. 

This paper develops efficient algorithms for filling an environment with 
a swarm of robots, thus focusing on the first phase described above. Be- 
cause target applications require thousands to millions of robots and current 
demonstration scenarios have at most hundreds of robots, it is crucial to 
develop algorithms that have explicitly stated performance guarantees, thus 
ensuring scalability. 

There has already been work on dispersion algorithms, but most disper- 
sion algorithms rely on greedy strategies such as go- for- free space, where 
robots move to fill unoccupied space, or artificial physics strategies, where 
neighboring robots exert "forces" on each other: repulsion forces if the robots 
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are closer than the target separation, and attraction forces if the neighboring 
robots are too far away. Although such robot behaviors often lead to reason- 
able dispersion, there are cases in which the swarm enters infinite loops, never 
filling the domain. Even in artificial physics strategies, which seem to disperse 
robots most efficiently, there are no guarantees on filling time. Furthermore, 
the swarm behavior under artificial-physics rules results in dispersions that 
are more reminiscent of molecular dynamics simulations (with large amounts 
of "bouncing" and "jitter") than of the ideal coordinated behavior of coop- 
erating teams of robots. 

The dispersion algorithms in this paper represent a departure from pre- 
viously studied dispersion strategies. First of all, we test our algorithms in 
arbitrary polygonal environments, rather than restricting ourselves to rect- 
angles or the infinite plane. It is important to study the dispersion algo- 
rithms in complicated polygonal domains because the target domains may 
be bottleneck-filled indoor environments or systems of pipes or ductwork. 

Our algorithms are based on follow-the-lcadcr local rules, where the robots 
form chains of leaders and followers emanating from a central source of robots, 
(the "doors"). The emergent behavior of our main algorithm is a left-hand- 
on-the-wall strategy, where the entire robot swarm hugs the left wall of the 
domain as the filling proceeds. We provide matching lower and upper bounds 
on the amount of time to fill as a function of the total size of the source. 
The difficulty in dispersing efficiently is in sustaining a large flow of robots 
coming in through the doors; the "chains" of robots coming in through the 
doors have a tendency to "cut each other off" ; see Figure g(lcft). Thus, we 
devise a set of local rules for splitting and splicing the robot chains. Our 
experience with our robot simulator shows that unless the local rules are 
precisely implemented, the flow of robots out of the source diminishes and 
perhaps even grinds to a halt prematurely. 




Fig. 1. Left: One turn can cut off most of the flow lines leading out of the door pixels 
(shaded grey): The left three columns of robots are trapped. Right: A polygonal 
domain encloses a discrete set of possible robot locations (pixels); special door pixels 
are shown in grey. 
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1.1 Our Results 

We develop and analyze algorithms for dispersing robot swarms in an un- 
known environment R. The primary objective is to minimize the makespan, 
that is, the time to fill the entire region. An environment is composed of 
squares or pixels that form a connected subset of the integer grid; such an 
environment is also called a maze. There is at most one robot per pixel and 
robots move horizontally or vertically at unit speed. There is a source of 
robots entering the environment through k > 1 doors. Robots are primi- 
tive finite automata, only having local communication, local sensors, and a 
constant-sized memory. The challenge is in proving that purely local strate- 
gies distributed over a swarm of agents can result in a predictable and prov- 
able behavior for the collective. 

This paper presents the following results: 

• We give filling algorithms for the single-door case (i.e., k = 1 ), analyzing 
the algorithms both theoretically and experimentally in terms of natural 
performance metrics, such as makespan and total traveled distance by 
all robots. Our algorithms are based on leader-follower strategies that 
are adapted from depth-first and breadth-first search to apply to robot 
swarms. We prove that our algorithms have optimal makespan 2 A — 1 , 
where A is the area (number of pixels) of R. 

• We give a filling algorithm for the multi-door case (k > 1), also based 
on a leader-follower strategy. The emergent behavior of the swarm is a 
filling strategy, called laminar flow, that generalizes the "left-hand-on- 
wall" strategy to multiple streams of robots coming in through multiple 
doors. An important contribution of the algorithm is the formulation of 
splitting and splicing strategies that maintain a large flow of robots out 
of the doors. We prove that the laminar flow algorithm is (9(log(fc + 1))- 
competitive, that is, the ratio of the makespan of our algorithm to the 
optimal makespan is (9(log(fc + f )). 

• We give a matching lower bound of !?(log(fc + 1)) on the competitiveness 
of the natural class of "simple-minded" strategies that use only strictly 
local information. 

• We report on experiments with a Java simulator that we built for swarms 
in grid environments. All of the algorithms in this paper are implemented 
in this simulator. 

While our results here are given purely in terms of robots moving syn- 
chronously on discrete grids, the results apply also to dispersing swarms of 
robots within an arbitrary connected planar domain, in the ideal setting in 
which the robots have perfect motion control and synchrony. More impor- 
tantly, our results may form the foundation for theoretical work on more 
realistic models of swarm robotics in the continuum, with asynchronous mo- 
tion, sensor errors, slippage, and hardware failures. 
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1.2 Prior and Related Work 

There has been considerable study recently of distributed control and coor- 
dination of a set of autonomous robots. Gage JlJ,|l5) has proposed the devel- 
opment of command and control tools for arbitrarily large swarms of micro- 
robots and has proposed coverage paradigms in the context of robot dispersal 
in an environment. Payton et al p0|]2l| ] propose the notion of "pheromone 
robotics" for world-embedded computation. Wagner et al. P, p7j]2g| , |2c| devel- 
oped multi-robot algorithms, directly inspired by ant behaviors, for searching 



and covering. Principe et al |13,22j and Suzuki et al ]12| , [25[|2q ] have studied 
algorithmic aspects of pattern formation in distributed autonomous robotics 
under various models of robots with minimal capabilities. The related flocking 
problem, which requires that a set of robots follow a leader while maintain- 
ing a formation, has been studied in several recent papers; see, e.g., p|,|6|,|l6| . 
Balch [|j has developed "Teambots", a Java-based general-purpose multi- 
robot simulator. 

There is a vast literature on algorithms for one or several robots to ex- 
plore unknown environments. The environments can be modeled as graphs 
(directed or undirected), mazes, or geometric domains, and the robots can 
have a range of computing powers; see, e.g., [p]j7|J^, ^0| , pl] | . Mitchell [jl9| in- 
cludes a survey of many results on exploring and navigating in geometric 
environments. 

Dispersion algorithms have been recently studied both in the context of 
multi-robot coverage and sensor network deployment; see ppl Ji^f^ . Methods 
include potential fields |l7],^3| , "artifical physics" , attraction-repulsion jl8) . 
and algorithms based on molecular dynamics || . While these studies have ex- 
amined empirically the merits of heuristics, they have not addressed formally 
the algorithmic efficiency of the dispersion problem. 



2 Preliminaries 



For an arbitrary connected region 1Z in the plane, we let R denote the set 
of pixels that lie entirely within 1Z. A pixel is an axis-aligned unit square, 
{(x, y) : i < x < i + l,j < y < j + 1}, whose lower left corner is given by 
the integer grid point Two pixels are neighbors if they share a common 

edge; thus, pixel (i,j) has four neighbors, {i — {i + (i,j — 1), 

and + 1). We refer to R as the environment, and we assume that it is 
connected. A subset, D C R, of the pixels are doors, which serve as sources 
of incoming robots. See Figure [l](right). 

A robot r is said to occupy a pixel p — (i,j) when it is located in that 
pixel. We assume that at most one robot can occupy any one pixel at any 
given time t. We let p{r, t) denote the pixel occupied by robot r at time t. We 
let prev(r, t) = p(r, t — 1) denote the pixel previously occupied by robot r. 
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Time is discretized into steps t = 0, 1, 2, 3, The positions of the robots 

at time t are given by the set S(t), where D C S(t) C _R. At time 0, there is 
one robot in each door: 5(0) = D. 

Each pixel is in one of three states at time t: (1) the pixel is an obstacle if 
it does not lie in R; (2) the pixel is occupied if it is in R and a robot occupies 
it; or (3) the pixel is unoccupied if it is in R and no robot occupies it at time 
t. An unoccupied pixel at time t is classified as either previously occupied, if 
it was occupied by some robot at some time prior to t, or frontier, if it has 
never been occupied. 

Robots have sensors that detect information about the local environment. 
In particular, if a robot occupies pixel (i,j) at time t, then we assume that 
it can detect the state of each pixel within a distance rg of (i, j), where rg 
is the (fixed) sensor radius. Robots have no global sensor (e.g., a GPS) and 
no knowledge of the coordinates of the pixels they occupy. 

We assume that each robot has a small finite memory that allows it to 
remember the sensor readings of the last T > 1 time steps; i.e., a robot 
knows the sensor readings it has taken at time t — T, t — T + 1,. . . , t — 1, t. In 
particular, each robot knows which nearby pixels have been occupied recently 
by other robots. The algorithms in this paper have an 0(l)-bit memory; thus, 
the robots have the computational power of finite automata. 

We assume that robots have a limited ability to communicate with nearby 
robots. In particular, at any given time step a robot is able to exchange a 
constant-size message with any robot that lies within a prescribed commu- 
nication radius, rc, of the robot. The communication graph, Q{t), of the 
swarm S(t) at time t is defined to be the undirected graph whose nodes are 
the robots S(t) and whose edges link pairs of robots that communicate. The 
swarm is said to be connected at time t if each connected component of Q (t) 
contains at least one door pixel. 

Robots move discretely on the grid of pixels. If at time t a robot occupies 
pixel (i,j) of R, and a neighboring pixel (say, (i + ) of R is unoccupied 
at time t, then the robot may take a step to the right, so that at time t + 1 
it occupies pixel (i + Naturally, no two robots can move into the same 
pixel, since no pixel can be occupied by more than one robot. Thus, if two 
robots are occupying pixels that neighbor an empty pixel p of R, then the 
robots must negotiate which one of them will have the right of way to enter 
p. A simple way to handle this negotiation is to establish a priority among 
the four directions, e.g., north, south, east, west. The robot occupying the 
pixel with the highest priority with respect to p has first rights for moving 
to p. 

The above rules on occupying and vacating pixels force a constant delay 
in the motion. Suppose that robot r occupies pixel (i,j) at time t and vacates 
(i,j) at time No other robot r' can occupy pixel (i,j) during time t + 1; 
the earliest possible time that the pixel can be occupied is t + 2. In principal, 
we could vary these local rules. For example, we could allow robot r' to enter 
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pixel at timestep t + 1, if it enters in one direction while the current 

robot r leaves in the opposite direction. However, these rules would require 
a much higher degree of coordination among the robots, and the movement 
decisions of robots would have to be less local. Specifically, whether a given 
robot can enter one pixel may depend on whether a different robot far away 
leaves another pixel. Alternatively, we could build a longer delay into the 
movement rules, but the resulting filling algorithms would be essentially the 
same. Note that the delay has an immediate impact on the necessary sensor 
range: If we require a delay of two time steps, robots need at least a sensor 
range of two, in order to be able to keep track of predecessors and successors. 

A similar issue arises in the way new robots enter through the door. For 
simplicity, we assume that a door pixel is always occupied by a robot; as soon 
as the robot occupying a door pixel moves to an adjacent pixel, a new robot 
materializes at the door. One can consider there to be an infinite supply of 
robots on the "other side" of a door pixel, so that a new robot steps into the 
door whenever the robot that was there previously moves to another pixel of 
R. When a robot first appears at the door, we assume that the robot points 
north. 

We say that the robots have filled the region R at time t if S(t) = R; in 
this case, the robots are well dispersed. A strategy is a set of rules by which 
robots move, basing their movement decisions solely on the constant amount 
of information they sense and remember. The makespan of a strategy is the 
minimum time, t* , until the robots have filled the region R. 

3 Dispersion Strategies for a Single Door 

We begin with the case of a single door pixel s, which we call the source. We 
describe strategies based on the notion of "leaders" and "followers" . 

A robot r at time t at position p(r, t) is classified as moving (meaning that 
it is "active") or stopped (meaning that it no longer moves). If r is moving, 
it is classified as either a leader or a follower. 

In our leader-follower strategies, each robot r at time t has a successor 
robot, succ(r,t), that is following r and a predecessor robot, pred(r,t), that 
is leading r. If robot r is at the door at time t (p(r,t) = s), then we define 
succ(r, t) to be NIL; if r is a leader, then we define pred(r, t) to be NIL. 

At time t = there is a robot at the door s, which is designated as moving 
and as a leader. 

3.1 Depth-First Leader-Follower 

The depth-first leader-follower strategy (DFLF) is inspired by depth-first 
search in a graph. Specifically, at any given time t £ {0, 1, 2,3,.. .}, there is 
exactly one leader, r, which is on pixel p(r, t) and r is looking for a frontier 
pixel (one that has never been occupied by a robot). (Thus, necessarily the 
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pixel from which the robot came, prev(r, t), is not frontier at time t.) If there 
are two or more frontier pixels neighboring p(r, t), the leader selects any one 
of them as its next destination. 

If the leader has no frontier pixel next to it, it stops (its state changes 
to "stopped"), and it tells its successor, succ(r), to assume the leadership 
(succ(r)'s state changes to "leader") at the conclusion of this time step. (The 
predecessor completes its move before taking over the leadership.) If the 
leader has no predecessor (i.e., it is a robot at the door), then the algorithm 
halts. (In this section, successors and (non-Nil) predecessors are invariant 
over time, and thus t is removed from the argument.) 

Any robot r that is a follower (i.e., not the leader) simply follows its 
predecessor: at time t it steps next to prev(pred(r),t), the pixel previously 
occupied by the predecessor of r. Note that once a robot stops, it never moves 
again. Furthermore, at any point in time there is exactly one leader. 

When dealing with finite automata, we have to use particular care when 
implementing the leader-follower strategy. As each robot has only a finite 
number of states, it cannot keep track of the identities of other robots, nor 
carry its own "ID label" . Instead, we use spatial information to pursue the 
immediate predecessor. Each robot r has a "heading" , indicating the direc- 
tion in which it is moving. The predecessor pred(r) is the nearest robot in 
that direction. Similarly, each robot keeps track of its "tailing" , the direction 
from which it came. Thus, the successor is the nearest visible robot in that 
direction. Whenever a robot is about to change its heading, it signals this 
turn to its immediate successor. Heading and tailing are updated accord- 
ing to delay and sensor-range. This is reflected in the following lemma and 
corollary, which are proved by introduction on time. 

Lemma 1. If r is not the leader at time t, then prev(pred(r),t) is an un- 
occupied pixel neighboring p(r,t). Furthermore, the maximum (L\) distance 
between r and pred(r) is 2. 

Corollary 1. If a pixel is not occupied in two consecutive time steps t and 
t + 1, then it was never occupied before time t. (i.e., there are no "large gaps" 
of time in the occupation of a pixel.) 

A further consequence of the above lemma and corollary is that the DFLF 
strategy can be implemented with a sensor radius of r$ — 2 and a memory 
of T = 1 (i.e., each robot recalls its previous sensor reading). Now it is not 
hard to derive the following result: 

Theorem 1. The DFLF algorithm halts when all pixels are occupied by 
robots. The makespan of the DFLF algorithm is2A~l, where A is the number 
of pixels of the environment R. 

Proof. At each time step there is exactly one leader and that leader will 
do one of two things: (1) move to a frontier pixel, or (2) change its status 
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to stopped and transfer leadership to its predecessor. (If the leader has no 
predecessor, that is, it is at the door, the algorithm halts.) Since each non- 
door pixel can be changed from frontier to non-frontier at most once, and each 
pixel can have a robot "park" itself on top of it at most once, the number of 
steps before the algorithm halts is at most 2 A — 1. The algorithm does not 
stop earlier since there is always a leader and the leader must do one of the 
two actions ((1) or (2)) above. □ 

Note that the factor 2 is a result of using a delay of 2. It is easy to see 
that this is best possible. 

The total distance traveled by all robots in a filling using DFLF is at 
most A 2 , since each robot steps at most A. We note that there are examples 
in which any dispersion strategy will require fl (A 2 ) total travel. One trivial 
example is a 1 x A rectangle, with the door pixel at one end. A trivial lower 
bound on the total travel for any given instance is the sum of the distances 
from each pixel of R to the door. 

DFLF may use substantially more total travel than an optimal (mini- 
mum makespan) strategy that minimizes total travel; consider, for example, 
a square with side lengths A 1 / 2 , for which DFLF requires total travel of A 2 
while an optimal strategy achieves total travel 0(A 3 / 2 ). 

3.2 Breadth-First Leader-Follower 

We now describe an alternative dispersion algorithm, the breadth-first leader- 
follower (BFLF) strategy. BFLF often has advantages over the DFLF strategy 
in terms of other metrics of performance (total travel of the robots, maximum 
travel of any one robot, total relative distance, etc), while still being optimal 
in terms of makespan (2A — 1). 

The BFLF strategy is substantially more complex than the DFLF, as it 
is not trivial to implement breadth-first search with local decision rules on 
(finite-automata) robots; indeed, our strategy does not perform breadth-first 
search, but it "approximates" breadth-first search sufficiently well to have 
some similar properties. 

As before, a robot can be in a "moving" state or a "stopped" state; 
however, we now introduce a third state, the waiting state, which is an interim 
state when a robot pauses temporarily and waits to be able to move. Once 
a robot is in a stopped state, it never moves again. In the BFLF strategy 
we can have multiple leaders, while in DFLF we have only one. As described 
for DFLF, we use a limited amount of spatial information to keep track 
of successors and predecessors. Other local rules are more complex and are 
described in the following. 

Initially, there is one leader robot — the robot at the door, s. When a 
robot r materializes at the door, it chooses to follow the previous robot that 
left the door. A leader always attempts to go to a neighboring frontier pixel, 
but makes sure it does not stray too far from its follower. If there are no 
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neighboring frontier pixels, the leader waits for the immediate follower to 
catch up. As soon as the immediate follower catches up, the leader becomes 
stopped and passes on the leadership role to the immediate follower. If the 
leader r at pixel u has a choice among neighboring frontier pixels, it picks any 
one of them to be its heading. If other frontier pixels adjacent to u remain 
unclaimed by other robots following different branches of dispersion, then the 
follower r' of r will choose one of these pixels as its heading when it arrives at 
it; if there remains a frontier pixel adjacent to u, then r"s follower r" chooses 
this pixel as its heading when it arrives at u. Here, r' and r" are relabeled 
"leaders" and pixel u is marked as a branching point. 

A robot r is only allowed to move at time t, if it satisfies the Movement 
Criterion (MC): A robot currently occupies prev(r, t) (its "parent" position) 
or a robot is currently heading for prev(r, t). If the MC is satisfied, the robot 
r moves to its heading; otherwise, it remains at its current position, p(r, t) 
but is still heading for its target pixel. As we will see below, a robot r is never 
blocked by its immediate predecessor except for the time step at which r first 
appears at the door or a time when its immediate predecessor is a leader with 
nowhere to go, meaning that the leadership will be passed to r. 

The BFLF strategy tries to create as many paths as possible at all times. 
The visited pixels form a tree that guides the directions that robots move; 
thus, pixels have parents and grandparents. At branches in the tree (pixels 
with > 2 children), robots alternate the direction that they travel. Specifi- 
cally, when a robot r leaves a pixel at time t, it tells its immediate follower 
r' = succ(r,t) what r"s immediate destination should be. Branching enables 
robots arriving at the same pixel at different times to go in different direc- 
tions, thereby balancing the flow. 

Lemma 2. // not all pixels are occupied, then some robot can move. 

The following structural lemma shows that the BFLF algorithm is non- 
blocking, i.e., a robot r is never "delayed" by its predecessors, only by its 
followers; i.e., only the MC holds a robot back, not temporary blockages 
downstream. Correctness follows by induction on the height of the tree. 

Lemma 3. // a robot is at pixel v and a robot is at pixel u, the non-root 
parent of v, then the robot at v is stopped. 

Corollary 2. A robot moves from the root every other time step, until all 
pixels are occupied. I.e., if a robot did not pop up at the root at time t, then one 
must pop up at time t+1, unless all pixels (including the root) are occupied. 

Lemma 4. The robots maintain a communication distance of 3, meaning 
that the algorithm works for r$ > 3. The maximum L\-distance from a robot 
to a follower is 3. Furthermore, at least one of the following pixels is occupied 
when a robot is at pixel u: (1) the parent of u, (2) the grandparent of u, or 
(3) the uncle of u. 
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Putting all the claims together and using similar reasoning as for Theo- 
rem 0, we get the same kind of bound on the makespan: 

Theorem 2. The BFLF algorithm halts when all pixels are occupied by robots. 
The makespan of the BFLF algorithm is 2 A — 1, where A is the number of 
pixels of the environment R. 

3.3 Experimental Comparison 

While the DFLF and BFLF strategies both have optimal makespan, they 
differ with respect to other performance metrics. We have implemented in 
Java a simulator for testing our dispersion strategies, while measuring various 
performance measures, including: (1) the depth of the tree of all paths from 
the source (i.e., the length of the longest path of a robot); (2) the average 
distance traveled by a robot during the dispersion; and (3) the average of 
the ratios, pi = li/dt, of the (Li) distance U from the door to robot i's final 
location to the total distance di traveled by robot i during the dispersion. 

We have compared the DFLF and BFLF strategies for the case of a sin- 
gle door, in a variety of different environments. Our results show that the 
average depth of the tree grows substantially more rapidly, as a function of 
the number of pixels, for DFLF than it does for BFLF; e.g., for an n-\yy-n 
square grid, BFLF tree depth grows linearly in n while DFLF depth grows 
close to quadratically in n. Similarly, the average distance traveled by a robot 
using DFLF grows more rapidly than using BFLF. The average of the ratios 
pi = li/di decreases with increasing problem size for DFLF, while it increases 
for BFLF. Details of the experimental results are deferred to the full paper. 

4 Dispersion with Multiple Doors 

We now consider the case in which there are k > 1 door pixels. In order to 
achieve rapid filling, the objective is to maintain a flow of robots out of as 
many doors as possible. The difficulty is that one robot chain out of one door 
pixel may unnecessarily block the flow of robots out of other door pixels. (See 
the introduction and Figure |l](left ) for examples.) 

4.1 Laminar Flow Leader- Follower (LFLF) Algorithm 

This section describes the Laminar Flow Leader- Follower (LFLF) algorithm. 
The LFLF algorithm maintains rapid flows by careful use of cutting and splic- 
ing. The emergent behavior of the swarm is a left-hand-on-wall strategy. The 
behavior of the algorithm is complex because bottlenecks in the environment 
may force the flow to divide into smaller flows that fill different regions and 
may merge and split. 

In the LFLF algorithm there are k multiple door pixels, s±, S2, • • • , Sfe. 
Note that LFLF matches DFLF algorithm if k = 1. As in the k = 1 case, 
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our dispersion strategies are based on leaders and followers. A robot r at 
time t at position p(r,t) is classified as active or inactive (stopped). If r is 
active, it is either a leader or a follower. However, unlike for k = 1, robots 
revert between leaders and followers. (In contrast, for k = 1 a robot changes 
roles from follower to leader but only relinquishes the leader role by stop- 
ping and becoming inactive.) Furthermore, in DFLF once a robot stops, it 
becomes inactive. In contrast, in LFLF a robot may temporarily stop while 
still remaining active. 

In LFLF (as in DFLF), robots have successors and predecessors, but in 
LFLF a robot's immediate successor or predecessor may change over time. 
Thus, each robot r at time t has a predecessor robot denoted robot- pred(r,t) 
and a successor robot denoted robot-succ(r, t). If robot r is at a door pixel at 
time t (p(r,t) = Si, for i G [1,&]), then robot-succ(r) = NIL; if r is a leader 
at time t, then robot-pred(r) = NIL. 

Here we further refine the roles of the pixels. A pixel can be: (1) an 
obstacle, (2) a frontier pixel (a pixel that has never been occupied), (3) an 
inactive pixel, that is, an (occupied) pixel that hosts an inactive robot, (4) an 
active pixel, (which may be occupied or not). If a pixel is unoccupied but 
previously occupied, then it is always active; if a pixel is occupied, then it is 
active if and only if the robot it hosts is active. 

Note that door pixels may be active or inactive, as described later. Active 
pixels play the role of transporting robots throughout the domain whereas 
inactive pixels are essentially obstacles. Analogous to the definition of prede- 
cessor and successor robots, each active pixel has predecessor and successor 
pixels. In particular, pixel u at time step t has successor pixel pixel-succ(u, t) 
and predecessor pixel pixel-pred(u,t). For door pixels Sj at any time t > 
pixel-succ(u, t) = NIL. 

A leader pixel u at time t has pixel-pred(u, t) = NIL. As we will see, if a 
leader pixel contains a robot, then the robot is a leader, however the leader 
robot may not be on a leader pixel and the leader pixel may not contain 
robots. We call a chain of predecessor pixels starting from the door pixel Si 
and ending at a leader pixel, a flow line. 

We now define formally the left side and right side of the flow. To do so, we 
first present some intermediate definitions. Consider a pixel v at time t, and 
let u — pixel- succ(v,t) and w = pixel-pred(v,t). Let the incoming direction 
for v at time t be the vector vH and let the outgoing direction for v at time 
t be the vector vub. 

Consider the (vertical or horizontal) neighboring pixels, when we start 
from the incoming direction vH and rotate clockwise until reaching the out- 
going direction v id. Those neighboring pixels at intermediate (vertical or 
horizontal) orientations are defined to be on the left-hand side of v time t. 
Thus, the left-hand side may contain 0, 1, or 2 pixels. Any neighboring pixel 
that is not on the left-hand side, is on the right-hand side; this includes the 
pixels at the incoming and outgoing directions. 



12 Hsiang et al. 



Note that leader pixels do not have outgoing edges, invalidating this defini- 
tion. For a leader pixel, we define all neighboring pixels (except the successor 
pixel) to be on the left-hand side. 

The left side of a flow line is the union of the left sides of the non-leader 
pixels in the flow line. 

The essential idea the LFLF algorithm is to treat the left side of the flow 
line completely differently from the right side of the flow line. Specifically, we 
splice into the left-hand side but we cannot splice into the right. 



4.2 Splicing and the LFLF Algorithm 

Suppose that at time t a leader robot r a reaches a pixel u — p{r a , t). If u has 
an (unoccupied) predecessor pixel pixel-pred(u, t), then robot r a moves to this 
pixel. (Since r a is a leader robot, it will never have an occupied predecessor 
pixel.) 

Otherwise, pixel u = p(r a ,t) is a leader pixel. If u = p(r ai t) has no 
neighboring frontier pixels, then robot r a tries to pass on the leadership. 
Robot r a first looks for a neighboring active pixel v. If no such v exists, 
then r a becomes inactive and passes the leadership to its successor robot 
robot-succ(r a ,t). When r a becomes inactive, pixel u also becomes inactive 
and its successor pixel becomes the new leader pixel. 

If robot r a looks for a neighboring active pixel v, and such a v exists, 
then r a checks whether u = p{r ai t) is a left-hand neighbor of v. If this is not 
the case for any possible v, then again r a becomes inactive and passes the 
leadership to its successor robot robot- succ{r ai t). As before, when r a becomes 
inactive, pixel u also becomes inactive and its successor pixel becomes the 
new leader pixel. 

If there exists such a v, robot r a chooses the first such v sweeping clock- 
wise, starting from the incoming direction. We redefine the predecessor pixel 
pixel-pred(u,t +1) := v and thus successor pixel pixel- succ(v,t + 1) := u. 
Thus, the previous successor pixel of v becomes a new leader pixel. (Note 
that this new leader pixel may or may not have a robot on it.) Now r a iden- 
tifies its new leader; it follows predecessor pixels starting at v until it finds 
a robot and sets robot-pred(r a ,t + 1) = r\> and robot-succ(ri,,t + 1) = r a . 
Robot r a then passes the leadership to r c = robot-succ(rb,t). 
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Fig. 2. The LFLF algorithm: There are six door pixels in the lower left corner. 
Splicing enables the flow to turn a corner. 

4.3 Correctness of LFLF 

We now show that LFLF always fills an environment. During LFLF, we claim 
that the following invariant is maintained: The left side of a flow line consists 
of visited pixels and obstacles. This follows from the fact that, whenever a 
robot enters a pixel, it turns as far left as it can (left-hand-on-the-wall). The 
invariant implies the following lemma, which we use to show that the visited 
regions propagate out from obstacle pixels: 

Lemma 5. The union of boundary points separates the visited region from 
obstacles and the right side of flows. In other words, the unvisited region only 
touches a flow from its right. 

Another useful property of flow lines is the following: 

Lemma 6. Flows do not self-intersect. 

Lemma 7. Leadership passing does not change the total number of leader 
robots. Thus, the number of leader robots at any time step t is equal to the 
number of active doors. 

Theorem 3. The robots always fill environment R. 

Proof. When a robot becomes inactive, it is a leader and its surrounding 
pixels must all not be frontier, for otherwise r would advance to such a pixel 
and still be active. Therefore, if an active pixel u is next to a frontier pixel 
v, any robot advancing into u will not become inactive and some flow lines 
can advance into the empty region so it will be filled. □ 

In the next sections we analyze the makespan for this filling strategy. 

4.4 Competitive Analysis for Online Strategies 

For any strategy S, let ns(t) be the number of robots that enter through the 
doors during timestep t; we define ns(0) = k. 

Note that the inherent delay of 2 in robot movement means that ns(t) 
is a highly erratic function, so that in one timestep k robots might enter 
the doors, implying that in the next timestep no robots enter the doors. 
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In LFLF additional delays are caused by splicings, and therefore we cannot 
even say that a robot enters an active door pixel every other timestep. To 
understand why, consider Figure ^|. When one flow line splices into a different 
flow line, a robot may get delayed by one additional time step in order to 
obtain the required separation between robots. If many splicing are happening 
next to each other, this can cause an accumulated delay of up to k time units, 
which causes a "wave" of packed robots that propagates back towards the 
door pixels. Thus, a door pixel might temporarily hold off ejecting robots for 
many units without the door pixel becoming inactive. Whenever an active 
door pixel does not eject a robot we say there is a door-pixel delay. 

Fortunately, the following lemma bounds the number of splicings (and 
also the total number of door-pixel delays). 

Lemma 8. A pixel can only be used to initiate 0(1) splicings. 

Theorem 4. The LFLF strategy is 0(log(fc + 1))- competitive. 

Proof. Consider the times {ii}[=o^ fc+1 ^ > w here ti is the latest timestep during 
which at least k/2 l door pixels are active. We call times to,ti,t2, . . ■ , tn g(fc+i)i 
significant events. Consider the time interval from ti to tj+i, when at least 
k/2 l+1 door pixels are active. Thus, the total number of robots leaving the 
door plus door-pixel delays during this interval is at least (k/2 l+1 )(ti + i — ti). 

Now consider an optimal strategy 5*, and consider the cut x associated 
with the significant event ti of LFLF. Cut \ is the boundary between the 
occupied and frontier pixels and has size at most k/2 1 . A lower bound on 
the makespan of any strategy is the area "behind" \ (i- e -j the side of \ not 
containing the doors) divided by the length of x plus the amount of time 
required to send the flow from the door to reach x- Thus, the makespan, 
opt, of S* satisfies 

Summing over all significant events we obtain that the total filling time of 
LFLF is 0(opt • log(/c + 1)). □ 

If we disregard the issue of delays caused by pixels having to be fully 
vacated before being re-entered, we get a whole class called sensible strategies, 
defined by the following two conditions: 

(1) ns(t) is non-increasing in t; and 

(2) If ns(t) > ns(t + 1), then at time t the number of occupied pixels 
bordering on a frontier pixel is n$(t + 1). 

The above proof also implies the following; quite clearly, this result still 
holds if we extend the notion of sensible strategies to allow for some delays, 
as long as these only add a constant factor to the filling times. 

Theorem 5. Any sensible strategy is 0(log(^ + 1))- competitive. 
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Fig. 3. Left: An environment that is hard for both depth-first and breadth- 
first filling strategies. This example shows that LFLF and related strategies are 
J?(log(fc + 1)) competitive. Right: A scenario in which any simple-minded strategy 
is S7(log(k + 1)) competitive. 



4.5 Competitive Analysis of Simple-Minded Strategies 

Under certain natural assumptions that limit the power of strategies, it is 
possible to give matching lower bounds on the competitiveness. To give some 
intuition, consider first the class of examples shown in Figure ||(left), which 
shows that the laminar flow algorithm may take as much as f7(log(/c + 1)) 
times opt. 

Theorem 6. The LFLF algorithm is J7(log(fc + 1))- competitive. 

This lower bound can be generalized to a much larger class of strategies. 
We say that strategy S is simple-minded if the following condition holds: 
There is a constant C , such that we can only tell the number A' of pixels in 
a region R' C R when a set R" C R has been visited that has all pixels in R' 
within L\ distance C . 

Considering simple-minded strategies is quite natural in the context of 
robot swarms: If we assume we only "know" a region R! after each pixel has 
been seen by one of the robots, then the constant C corresponds precisely to 
the sensor range. The example of Figure ||(left) can be generalized as shown 
schematically in Figure |](right), which forms the basis for the proof of the 
following theorem: 

Theorem 7. Any simple-minded strategy is i?(log(fc + 1))- competitive. 

A particular subclass of simple-minded strategies is one that arises quite 
naturally in exploration problems: We say that a strategy is conservative 
if any pixel that has been "discovered", i.e., come within sensor range of a 
robot, must stay within sensor range of some robot. For a constant-size sensor 
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range, it is easy to prove the following statement, which has been claimed for 
a long time in a different context: 

Proposition 1. All conservative strategies are simple-minded. 

As the LFLF strategy is conservative by design, we conclude that the 
lower bound of Theorem [?] applies to LFLF. 
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